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Abstract. We give an iterative algorithm for phase estimation of a parameter 6, 
which is within a logarithmic factor of the Heisenberg limit. Unlike other methods, we 
do not need any entanglement or an extra rotation gate which can perform arbitrary 
rotations with almost perfect accuracy: only a single copy of the unitary channel and 
basic measurements are needed. Simulations show that the algorithm is successful. We 
also look at iterative phase estimation when depolarizing noise is present. It is seen 
that the algorithm is still successful provided the number of iterative stages is below a 
certain threshold. 
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1. Introduction 

Phase estimation is of fundamental importance to quantum information and quantum 
computation. It is related to some very important problems such as estimating 
eigenvalues [U El El S], the factoring and search algorithms 0, Section 5.3], precision 
measurement of length and optical properties, and clock synchronization [6] . 

Suppose that we have a unitary matrix Uq depending on an unknown parameter 
6 and that one of its eigenvectors \u) is completely known; furthermore Ue acts on \u) 
in the following way: Ug\u) = e^^'^^lw), where 6 G [0, 1). The task of phase estimation 
is to estimate the eigenvalue e*^'^^, and consequently 9, as accurately as possible. In 
this paper we investigate phase estimation of a unitary matrix with known eigenvectors, 
which acts on a 2-dimensional Hilbert space. In particular, we look at unitary matrices 
of the form 



Ue= le , (1) 




where 9 G [0, 1). We will think of 9 as being a point on a circle of unit circumference, and 
confidence intervals for 9 as arcs on a circle of unit circumference, known as confidence 
arcs. We define the distance between an angle 9 and an estimate 9 as 

1^ - ^li = min ({9 - 0),,odi, {9 - ^),,odi) • (2) 



It is important to recognise that the angle 9 is an angle on the circle, and so in this 
paper all arithmetic is modulo 1. 

We now introduce the basic notions of quantum states and POVMs. A quantum 
state is represented by a density matrix p, and satisfies tr{p} = 1, p = p"!", and p > 0. In 
this paper we will mainly be dealing with pure states. Any state which can be written 
as p = \'ip){'ip\ is said to be a pure state; we will often refer to a pure state by its state 
vector Given a quantum state p, we can measure it using a POVM M = {Mm}, 
obtaining outcome m with probability given by the Born rule 

p{m) = tr{pM„}. (3) 

A POVM M = {Mm} satisfies M^ = Ml, > and M^ = I. 

We shall quantify performance of phase estimation schemes in terms of the expected 
fidelity {F{Ug, Ug)) between Ug and Ug. We use the cost function 

1 - {F{Us, Ue)) = 1 - ^ ^ (4) 

and look at its asymptotic scaling with n - the number of times that Ue is used. 

For a simple phase estimation approach where Ug is used once on n identical copies 



of some input state (see Section 2.1) we get 1 — (F) = 0{l/n). This is known as the 
standard quantum limit |6]. However, it has been shown that when n copies of the 
channel are available, [3 El E] we can get 1 — (F) = 0(l/n^). This rate at which 1 — (F) 
approaches zero is known as the Heisenberg limit [IQ\, and cannot be beaten |8j. 
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In this paper we are interested in iterative phase estimation, when we have only 
a single copy of Uq, similar to that of Kitaev [11], for which 1 — (F) = 0{{logn/ny). 
This is within a logarithmic factor of the Heisenberg limit. Note that there are other 
iterative phase estimation procedures with 1 — (F) = 0(l/n^), but they require an extra 
rotation gate capable of doing arbitrary rotations to high precision (see Section 2.5). 

In Section 2 a selection of different phase estimation schemes are described. In 
Section 3 problems with some of the previous methods are explained. Section 4 contains 
our algorithm for phase estimation and a theoretical evaluation of its performance. In 
Section 5 simulations are performed to check that our algorithm works. Section 6 looks 
at the performance of our phase estimation algorithm in the presence of depolarizing 
noise. 



2. Phase estimation methods 



2.1. Simple approach 

A very simple method is to let Ug act on the input state \ipx) = l/v^(|0) + ^^e 
output state is {ipe) = 1/a/2(|0) + e*^'^^|l)). If we measure in x, that is we use the 
POVM = {Ml = \ipx){i;x\,Mo = I - Mi}, then we get outcome 1 with probability 
Px{l', = (1 + cos(27r^^))/2, and outcome with probability Px{0; 9) = 1 — Pxi^', 0). If 
we perform measurements we get an estimate cos(27r6') = 2Nx=i/N — 1 of cos(27r6'), 
where Nx=i is the number of times we obtain the outcome 1. If we measure in y, that 
is, use the POVM My = {Mi = \'^y){'^y\, Mo = I-Mi}, where \ipy) = 1 / V2{\0) + i\l)) , 
we observe outcome 1 with probability Py{l; 6*) = (1 + sin(27r6'))/2, and outcome with 
probability Py{0; 9) = 1 — Py{l] 9). Performing measurements, we get an estimate of 
sin(27r6'); from estimates of cos(27r^) and sin(27r^) we are able to estimate 9. 



2.2. Kitaev 

As far as we are aware, Kitaev [11] gave the first /-stage iterative phase estimation 
procedure. (The number of stages / is chosen before hand, and will depend on the 
precision desired and experimental limitations). At the fcth stage of Kitaev's procedure, 
Uq acts 2^~^ times on a qubit, which is then measured. We perform some multiple of 
log(//e) measurements of (2'''~^6')modi- This ensures that we can "localize each of the 
numbers (2'^^-'^6')jnodi in one of the 8 intervals [(s — l)/8, (s + l)/8] (s = 0, . . . , 7) with 
error probability < e//." Using this information, an algorithm — which is not given — 
gives us an estimate 9 satisfying 

2.3. Rudolph and Grover 

Rudolph and Grover [12] looked at the problem of transmitting a reference frame from 
Alice to Bob, which is linked to estimation of an unknown U G SU{2), parametrized by 
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three parameters a,6,(j). The scheme of Rudolph and Grover involves estimating the 
parameters a, 9, (p individually using the following Z-stage iterative procedure. We set 
6 G [0, 1) to have an infinite binary expansion 6 = wiW2 . . .wi . . .. At the kth stage a 
qubit is sent back and forth between Alice and Bob in such a way that, when Bob finally 
measures it, he obtains outcome 1 with probability Pk{i', 6') = (1 + cos(2'^7r6'))/2. 

This is repeated a minimum of = 321og2(2Z/e) times [12], which ensures that 
Bob's estimate pk{l',0) of pk{l;6) satisfies 

Pr(^(^Pk-^,Pk + \^3Pk^>l-j. (6) 

It is assumed that if \pk — Pk\ < 1/4, then Bob can estimate the kth bit of 6 correctly. 
If this is so, then from ([6]), the probability that Bob estimates the fcth bit of 9 correctly 
is at least 1 — e/l, and the probability that he estimates all of the binary digits of 6 
correctly is at least 1 — e. After / stages, we get an estimate 9 = wiW2 ■ ■ - Wi, satisfying 

P^((^-hJ+^)^e]>l-e. (7) 



2'' 2'^ 

A similar scheme is then used to estimate the parameters a and 0. The method of 
Rudolph and Grover has been used in [H] for the problem of clock synchronization. 



2.4- Zhengfeng et al 

Zhengfeng et al highlighted two errors with the method of Rudolph and Grover: (i) 
knowing \9 — 9\i < 1/2*" does not imply that we know the first m bits of the binary 
expansion of ^ — consider 9 = 0.49, 9 = 0.5 and m = 1, (ii) the method is problematic 
(in the sense explained in section 3) for 9 close to 1/2. 

Zhengfeng et al. gave the following /-stage procedure. In the first stage we let 
Ug act on lipx) and measure in x; we obtain outcome 1 with probability p{l',9) = 
(1 + cos(27r^^))/2. The state Uglipx) is measured some multiple of log(//e) times (A^), 
which gives an estimate 9 satisfying 

Pr((<'-4<' + ^)3<')>l-f (8) 

Having obtained an estimate 9, 

1) if ^ G [0, 5/12), define n = 2 and z/i = 0, 

2) if ^ G [5/12, 7/12), define n = 3 and = 1, 

3) if ^ G [7/12, 1), define n = 2 and ui = 1. 

At the kth stage we apply Ug rir2 ■ ■ .r^^i times. After measuring Ug^'^^"''^''~^\ipx) N 
times, we estimate (rir2 . . . 'rfc_i^^)modi and obtain and z/^ in a similar way to that in 
which we obtained ri and z/i. After / stages we have (ri, . . . , r/, i^i, . . . , z//). Our final 
estimate of 9 is 
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2.5. Dobsicek et al 

A popular iterative estimation method is to take 9 to have a binary expansion of length I 
plus some small remainder, that is 6* = wiW2 . . . + A. The binary digits wi, . . . , are 
measured one at a time with a single measurement. This has been done in [TU [151 IIHl [U] • 
(Higgins et al [17] were the first to give, and carry out experimentally, a method of this 
form which achieves the Heisenberg limit.) We review the method as described by 
Dobsicek et al [U]. 

At the fcth stage we let U^^ '^^^ act on one of two qubits; the other qubit is acted on 
by a Z-rotation gate e*"*""^ before being measured — where = and for k = 2, . . . ,1 
depends on the results from the previous k — 1 stages. From this measurement we get 
an estimate wi+i^k of the (Z + 1 — A;)th binary digit. After I stages we get an estimate 
9 = W1W2 . . .wi oi 9 which satisfies 



We can increase the probability that our final interval contains to 1 — e by either 
(a) increasing the number of rounds to /' = / + log(2 + l/(2e)) or (b) using 0(log^(l/e)) 
extra measurements of the first few binary digits [15]. The method of Dobsicek et al 
has recently been carried out on experimental data in [18]. 

The method of Dobsicek et al [^ has been analysed when there are internal static 
imperfections and residual couplings between qubits [IH] . It was shown that this type of 
noise is detrimental to the performance of Dobsi'cek's method, however, solutions were 
found to overcome this problem in |19j . 

3. Problems 

There is nothing wrong with Kitaev's method of iterative estimation. However, he does 
not give an algorithm for (i) choosing which of the intervals contains (2'^~^^)modi with 
probability 1 — e//, (ii) reconstructing 9 given confidence intervals for (2^^^6')modi- As 
we will see in this section there are gaps in the methods of Rudolph and Grover, and 
Zhengfeng et alioi (i). 

There are two main gaps in Rudolph and Grover's method, which we now explain. 
Firstly, Pfc(l; 9) = (1 + cos(2'^7r0))/2 is a multimodal function of 9. For example, 9 = 3/4 
and 9' = 1/4 give the same value of pi{l; 9), even though they differ in the first binary 
digit. To overcome this, we need an estimate of sin(27r0) as well. This however is a 
trivial point and is easily overcome. 

Secondly, if ^ = 1/2±S, where 6 is small, we require a large number of measurements 
to determine the first bit of 9 correctly with high probability. If we do make a mistake 
then for our final estimate 9 we will have \9 — 9\i > 6. This problem, which occurs for 
9 close to 1/2, was pointed out by Zhengfeng et al. 

A similar problem also occurs for = ± 5. Because of this, we will encounter 
difficulties in estimating the kth bit of 9 whenever (2^~^^^)inodi ~ 0, (2^~^6')modi ~ 1 or 




(10) 
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(2'^~^^^)modi ~ 1/2. However, it may also be possible to overcome this issue using extra 
rotation gates in these cases. 

There are also gaps with the method of Zhengfeng et al p3]. Firstly, like Rudolph 
and Grover, they overlook the fact that 9) = (l + cos(27r6'))/2 is bimodal. Secondly, 
the accuracy of their final estimate relies on the assumption that if \9 — 9\i < 1/12 and 
9 e [0, 5/12), then 9 G [0, 1/2). This is not true, as we could have 9 = -1/12 ^ [0, 1/2). 
Similarly, they assume that if \9 - 9\i < 1/12 and 9 e [7/12, 1), then 9 e [1/2, 1), which 
again is not true, as we could have 9 = 1/12 ^ [1/2, 1). Again we will get problems at 
the kth stage if (ri ■ ■■rk-i9)^odi ~ or (ri ■ ■■rk-i9)^odi ~ 1- 



4. Our approach 

This section contains our method for phase estimation. Firstly, we describe the method 
for going from confidence arcs for 9, (26')modi5 (46')modi5 • • • ? (2'~^6')inodi5 of length 1/3 
and coverage probability at least 1 — e//, to a confidence arc for 9 of length 1/(2'^^ x 3) 
and coverage probability at least 1 — e. Secondly, we describe how to get a confidence 
arc for (2''~^^^)modi, of length 1/3 and coverage probability 1 — e/l. Thirdly, we use 
Hoeffding's inequality to calculate the number of measurements needed at each stage. 
Finally, we show that 1 - {F{U§, Ue)) = 0{(\ogn/nY). 



4.1. The iterative phase estimation algorithm 



In this section we introduce our method for computing a confidence arc for 9 of length 
l/(2'~^ X 3) and coverage probability 1 — e. First we give an intuitive approach using 
examples. For computational simplicity, we look at confidence arcs of length 0.3 and 
coverage probability 1. Lk and Jk will denote confidence arcs for (2^"^6')modi and 2''~^9 
respectively, of length 0.3 and coverage probability 1. ( In our more general algorithm 
and Jk will have length 1/3 and coverage probability at least 1 — e/l.) For the examples 
we choose / = 3. 
Example 

Suppose that after doing some measurements of Ue, f/| and Ug we find 
Li = [0.6,0.9] 9 9 



L2 



[0.3,0.6] 9 (2^)„,odi 
[0.8,1.1] 3 (4^)n,odl. 



It follows from (11) that 

2Li = [1.2,1.8] 3 29. 



Using ([12]) and it follows that 
J2 = [1.3,1.6] 3 29. 



From ( 15 ) we know that 

2J2 = [2.6,3.2] 3 4:9. 



(11) 
(12) 

(13) 
(14) 
(15) 
(16) 
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Using (13) and (16) we get 

Js = [2.8,3.1] 3 AO. 



(17) 



Using confidence arcs ( |ll| ), (12) and (13) for 6, (2^^)inodi and (46')modi respectively, 
of lengtli 0.3 and coverage probability 1, we have derived a confidence arc (17) for 46' of 
length 0.3 and coverage probability 1. From this, we get a confidence arc for 9 of length 
0.3/2^"^ = 0.075 and coverage probability 1, namely 

(1/4) J3 = [0.7,0.775] 3 6. 

Remember that we are looking at confidence arcs on a circle. On the circle the arc 
[1.2, 1.8] is equivalent to the arc [0.2, 0.8], as is [2.2, 2.8], [3.2, 3.8] . . .. Similarly [2.6, 3.2] 
is equivalent to [0.6, 1.2]. 

We define the symbol Ci to mean that a confidence arc on the circle, is a subset 
of another confidence arc on the circle. Similarly, we define Gi to mean that a point 
is contained within an arc on the circle, e.g. 0.3 Gi [1.2,1.8]. The previous example 
was rather simple in that [0.3,0.6] Ci [1.2, 1.8] and [0.8, 1.1] Ci [2.6,3.2]. In general we 
cannot assume that Lfc+i Ci 23^. 



General Algorithm 

Our confidence arcs are now of length 1/3 rather than 0.3. Let us put 

Lfc = [x(A;),x(A;) + l/3], a;(A:) G [0, 1), 
Jk = [z{k),z{k) + 1/3]. 

As in the Example we use 2Jk and Lk+i to find a confidence arc Jk+i- We insist that 
Jk+i C 2Jfe. For this we require that z{k + 1) G [2z{k),2z{k) + 1/3]. Assuming that 
Jk 3 2''^^6 and Lk+i 3 (2'^6')modi then there are three possibilities. For each possibility 
we give a figure, with a small vertical line representing the choice of the lower bound 
for Jfc+i. (Note that Ji = Li.) 

(i) The simplest possibility is that L^+i Ci 2/^. This occurs when {x{k + 
1) — 2z{k))^odi G [0,1/3). We choose Jk+i to have lower boundary z{k + 1) = 
2z{k) + {x{k + 1) - 2z{k))^odi. 



z(k+l) 



2Ju 



Figure 1. Situation 1 



(ii) Another possibility is that x{k + 1) ^1 2Jk but x{k + 1) + 1/3 Gi 2Jk. This 
occurs when {x{k + 1) — 2z{k))^odi ^ [2/3, 1). In this case we take the lower boundary 
of Jk+i to be z{k + 1) = 2z{k). 
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L 



z(k+l) 



23^ 



Figure 2. Situation 2 



L 



Figure 3. Situation 3 



(iii) The final possibility is that x{k + 1) Gi 2Jk but x{k + 1) + 1/3 0i 2Jfc. This 
occurs when {x{k + l) — 2z{k))^o^i G [1/3,2/3). In this case we take the lower boundary 
of Jk+i to be + 1) = 2z{k) + 1/3. 

Using this iterative scheme, we end up with the confidence arc Ji = [z{l), z{l) + 1/3] 
for with coverage probability 1. This immediately gives us a confidence arc for 6 
of length l/(2'-i x 3), namely (l/2'-i) J/ = [z{l)/2^-\ {z{l) + l/3)/2^-^]. We take the 
middle of this interval modulo 1 as our estimate 9 of 9, i.e. {{z{l) + l/6)/2'~^)modi- 

The final confidence arc for 9 of length l/(2'~^ x 3) contains 9 if Lk 3 {2^^^9)nwdi, 
for every k = 1, . . . ,1. If each has coverage probability at least 1 — e/l, the probability 
that every Lk contains (2^^^6')modi is at least 1 — e. 

4-2. Finding Lk 

Here we give the details of calculating the confidence arcs for (2^~^6')jnodi of length 
1/3 and coverage probability at least 1 — e/l. First we show how a confidence arc of 
length 1/3 is computed, then we show how to make the coverage probability at least 
1 — e/l. We shall look at finding a confidence arc for 9. The analysis is exactly the same 
as for (2'^~^6')modi except in the latter case we let Ug act 2'^~^ times on the same \ipx)- 

We let Ug act on lip^) and measure in x. We obtain outcome 1 with probability 
Px{l', 9) = {1 + cos(27r6'))/2. We measure Ug\ipri:) a total of times and obtain outcome 
1 A^j.=i times. We then have an estimate 2Nx=i/N — 1 of cos(2'7r^). 

We let Ug act on \il)x) and measure in y. We obtain outcome 1 with probability 
Py{l]9) = (1 + sin(27r6'))/2. We measure Ug\ipx) in ?/ a total of times and obtain 
outcome 1 iVj,=i times. We then have an estimate 2Ny=i/N — 1 of sin(27r6'). We get an 
estimate 




of 9. We can construct Li as 
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More generally, given an estimate (2 ^A;)modi of (2 ^)modi we get the confidence arc 



x{k),x{k) + ^ I , 



x{k) 



1 

3dl 6 



mod 1 



It is shown in the appendix that if 



and 



N 



Ny=l 



N 



< 0.306 



< 0.306, 



then Li 3 9. It follows that if 

iVx.=i 



and 



then 



Pr 



Pr 



N 

Ny=l 



N 



< 0.306 > Wl - 



< 0.306 > Wl 



(18) 



(19) 



(20) 



(21) 



Pr(Li 3 9)>1 



I' 



(22) 



An analogous result holds for L^, k = 2, . . . ,1. It is shown below that if 



5.341og(4//e) then (20) and (21) hold. 



4-3. Number of measurements needed 
The Hoeffding inequality |2^ will be used. 

Theorem 1 Given independent random variables Xi, . . . ,Xn with ai < Xi < hi, then 
the following inequality holds for the sum Sn = J2i=i-^i- 

2nH^ 



Vi{\Sn-E[Sn] \ > nt) < 2exp 



(23) 



The observed measurement outcomes from a single measurement in x are independent 
random variables Xi with < Xj < 1, and for which Sn = Nx=i and -Ef^n] = npx{l] 9). 



Using ( 23 ) it is straightforward to show that 



Pr 



Px(l; 



>?j < 2exp {-2Nt^) 



From ( 24 ) it can be shown that ( 20 ) holds if 
N = 5.34 In ' 



(24) 



(25) 
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measurements in x are performed at each stage. The analysis is exactly the same for 
measurements in y, and so a total number of 

Ntot = l0.68\n(^^ (26) 



measurements are required at each stage. This ensures that (20) and (21) hold, and 



consequently (22) holds. 



4.4- The behaviour of the fidelity 

We now see how 1 — {F{Uq, Uq)) scales with the number of times Ue is used. As in [12], 
we look at the worst-case value of 1 — {F{Ug, Uq)). That is, if the final confidence arc 
does not contain 6 then 6 = {6 + l/2)modi, and if it does then 9 lies on the boundary of 
the confidence arc, i.e. |^ — ^|i = 1/(2' x 3). This gives 



e + 



7r^ evr^ 



22' X 9 22' X 9' 



If we choose e = 1/22', ^^^^ ^ _ f/^)) = 0(1/22'). rj^^^.^ requires a total of 

A^tot = 10.681n(4Z X 2^') (27) 
measurements at each stage. The number of times Uq is used is ra = Ntot^ — 1), and so 



1/2' ^ Ntot/n. The number of measurements, (27), made at each stage is 0{l)\ noticing 
that logn is also 0(/), it follows that 

l-(F(f/,,[/,))=o(^(^yj. (28) 
5. Simulations 



The analysis in Section 4.4 concentrated on optimizing the worst-case asymptotic scaling 
of 1 — {F) with respect to n. The cost function 1 — (F) is very sensitive to outliers. A 
large number of measurements, ( [27| ), were chosen so that the probability that the final 
interval did not cover 9 was 1/22'. 'jj-^jg ensured that 1 — {F) was within a logarithmic 
factor of the Heisenberg limit. Rather than choosing Ntot large to remove the large 
contribution of outliers, an experimenter may be happy enough if the probability that 9 
is contained by his final confidence arc is greater than some value. This approach does 
not sacrifice precision, but rather an unnecessarily large coverage probability. 

In this section we give a table of simulated results, for different numbers of iterative 
stages, and different numbers of measurements at each stage. From this table we show 
how to calculate a confidence interval for the coverage probability. An experimenter 
who wants a confidence arc for 9 of certain length and coverage probability could look 
at the table and find the number of measurements needed to achieve this. 
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Simulations are performed with the computer package MAPLE. A value for the 
parameter 6 G [0, 1) is given by a random variable with a uniform distribution. 
Measurement results can be simulated, since the number of times outcome 1 is observed 
has a Binomial distribution. For example, at the kth iterative stage, measuring in x, 
Nx=i ~ Bin(A^, (1 + cos(2'^7r^))/2). From the simulated results of measurements in x 
and y for stages 1, . . . , Z, an estimate of 9 is obtained using the iterative algorithm given 
in Section 14.11 We can then test whether our final confidence arc contains 6. This is 
done for 100, 000 randomly chosen 6, and the number of times that 6 is contained by 
the final confidence arc recorded. 

For most recent iterative schemes the total number of iterations is reasonably small: 
6 in [T7] and 7 in [18]. We look at simulations with the number of iterations varying 
between 6 and 9. Table [T] gives the number of times the final confidence arc contains 
the true value of 9. 



Table 1. Numbers of trials out of 100,000 with \0 - 9\ < 1/(2' x 3). 



Number of iterative stages (/) 



20 99,792 99,729 99,747 99,712 

30 99,993 99,987 99,982 99,978 

40 99,999 100,000 99,998 99,999 

50 100,000 100,000 99,999 100,000 



Using the above simulations the coverage probability can be estimated, i.e. the 
probability that, using the iterative algorithm, the known true value 9 is contained in 
our final confidence interval. 

Suppose the true (unknown) coverage probability is p. For the ith trial put 

Wi = 1 if interval covers 9 
= if not. 

Then Wi, . . . , Wm are independent identically distributed Bernoulli random variables, 
i.e. Wi ~ Bin(l,p). Thus 

PFi + ■ ■ ■ + P^Af ~ Bin(M,p). 

If m out of M intervals cover 9 then p is estimated by m/M. An approximate 95% 
confidence interval for p is 



± 1 96i /^/^ " ^1^) 



M \l M 

The longest confidence interval (0.00066) is that for using 9 iterative stages and a total 
of 20 measurements at each stage. Using the half-length of this confidence interval. 
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we can compute a confidence interval from tlie results given in Table [l] with coverage 
probability at least 95%: 
m 

±0.00033. 



100,000 

If an experimenter is content with a confidence arc of length no smaller than 1/(2^ x 3) = 
1/1536 and estimated coverage probability no greater than 99.6% then he need perform 
no more than 20 measurements at each stage. If the experimenter wanted to use even 
less measurements he could produce his own table of simulated results possibly even 
varying the number of measurements performed at each stage. 

6. The noisy case 

It is known that when even a small amount of noise is present the performance of phase 
estimation schemes is greatly reduced [211 122] • This section investigates the performance 
of the iterative estimation algorithm when depolarizing noise is present. The channel 

Po^{l- r)UePoUl + ^la, < r < 1, (29) 
is considered, where Ug is the same as before, ([T]), and po = \4'x){'4^x\- (The channel (29) 



is identical to UgpoUl undergoing phase damping with A = r(2 — r) Op. 383].) Ji et al 
[T3] gave the very interesting result that if r > 0, then the optimal asymptotic rate at 
which 1 — {F{Ug, Ug)) approaches zero is given by the standard quantum limit. 

The whole point of using an iterative scheme is that the distinguishability of 6 from 
cos(?7.27r^), with n » 1, is considerably greater than from cos(27r6'). One measure of 
distinguishability is the Fisher information. Given a family of probability distributions 
with density functions p{x; 6), the Fisher information is defined as 

F-^ jp(.;0)(^-^^)\. (30) 

Intuitively the Fisher information tells us the amount of 'information' about a parameter 
contained in a probability distribution. The Symmetric Logarithmic Derivative (SLD) 
quantum information Hg tells us the maximal attainable Fisher information obtained 
from measuring a state depending on an unknown parameter [23], i.e. 

F^' < Hg. (32) 

The SLD quantum information is defined in terms of the SLD quantum score \g as 

Hg = tl{\gpg\g}, 

where Xg is any self-adjoint solution of the matrix equation 

1 
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To measure distinguishability, the quantity /m will be used, where m is the number 
of times Uq acts on the same input state. This is because of interest is to maximize the 
distinguishability of 9 per use of the channel. 

If there is no noise, and the experimenter lets Ue act m times on the input state and 
measures in then outcome 1 is observed with probability 9) = (l + cos(m27r6'))/2 
and with probability p^(0;6') = 1 — px{^\9). The Fisher information from this 
measurement is Fg^"^ = 47r^m^, which is equal to the SLD quantum information. 
Measuring in y gives the same Fisher information. Thus Fq ^"" /m = Fg ^^ /m = An'^m. 
At the fcth stage of the iterative procedure, we let Uq act m = 2^^^ times on the input 
state, and so F^^^ /m = F^ym = . Thus F^'^ jm (where M is an arbitrary 

measurement in x or y) increases exponentially with k. 

In the noisy case, letting Uq act m times on the output state and measuring in x, 
outcome 1 is observed with probability Px{X] ^) = (1 + (1 — t)™" cos(m27r^))/2 and with 
probability (0; ff) = 1— ^^^(l; 9). Measuring in y, outcome 1 is observed with probability 
Py{l]9) = (l + (l-r)'^sin(m27re))/2 and with probability pj,(0;e) = l-py{l]9). This 
gives 

47r2m2(l - r)2™ sin2(2m7r^) 



1 - (1 -r)2'"cos2(2m7r^) 
47r2m2(l - r)2™ cos2(2m7r^) 



Notice that 



1 - (1 - r)2'^ sin^(2m7r^ 
He =47rW(l-r)2". 



Thus measuring both in x and y, the average Fisher information from a single 
measurement M is approximately H0/2. 

The maximal value of F^ /m, taken over m, will occur close to the maximal value 
of He/m. When r > 0, Ho/m, and hence Fg '^ /m, does not increase indefinitely with m. 
Instead it reaches its maximum at 

m = ^ — ; ^, (33) 

21og(l-r)' ^ ' 

after which it decreases. When r is small, this maximum is obtained at 

m ^ — . (34) 
2r ^ ^ 

The number of stages that can be performed, for small r, such that Hg/m, and hence 

Fq^ /m, increases at each stage is approximately / ~ — logg^. A consequence of this is 

that estimation close to the Heisenberg limit is not possible, asymptotically, when there 

is any depolarizing noise. This gives an alternative insight into the result of Ji et al [13]. 

Figure [4] gives Hg/m at the fcth iterative stage when r = 2~^. It can be seen that 

Hg/m increases up to = 5, decreases slightly near k = Q and falls rapidly for > 6. 

Other figures not included here give similar information, showing Hg/m increasing up 

to = — log2 r, and decreasing rapidly for > — log2 r. 
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Table |2] contains the results of simulations, for magnitudes of noise r = 
2-4^ 2~^, . . . , and total number of iterative stages I = 4, ... ,9. Consider the diagonal 
of Table [2| from r = 2~^, / = 4 to r = 2~^, / = 8. This corresponds to the experimenter 
performing / = — log2 r iterative stages, which involves going up to the iterative stage 
at which Fq^ /m is maximized. Similarly, the diagonal from r = 2"'', / = 5 to r = 2~®, 
/ = 9 corresponds to the experimenter performing I = — log2 r + 1 iterative stages etc. It 
is interesting to note that when I > — log2 r, there is a significant decrease in the number 
of confidence intervals containing 6. If the experimenter performs / = — \0g2r iterative 
stages then the final confidence interval contains 6 approximately 98% of the time; for 
/ = — log2 r + 1 iterative stages, the coverage probability decreases to approximately 
89%. For I = — log2r + 2 iterative stages, the coverage probability is approximately 
61% - a considerable drop in performance. Simulations using more measurements at 
each stage have given similar results. 

It is interesting to see that the drop off in performance, in terms of the coverage 
probability, occurs at the same point as the drop in performance as measured by He/m, 
and consequently Fe/m - seen in Figure |4| 



We suggest, more generally, that for the channel (29) the optimum number of 
iterative stages, where at the kth stage Ug is used 2^^"^ times, is / = [— log2rJ. 

A related question was considered in [5^, where the 'stopping point', was the 
number of entangled photons to be included in the NOON input states. Rubin and 
Kaushik found that the optimal precision in measurement occurred for = 1.279/L, 
where L is the magnitude of loss (analogous to the point, n' = l/(2r), at which Fg ' /m 
is maximized). 



Table 2. Numbers of trials out of 100,000 with \9 - 9\i < 1/(2' x 3), with Ntot = 30. 



Number of iterative stages (/) 



r 


4 


5 


6 


7 


8 


9 


2-4 


98,290 


88,340 


60,423 


32,445 


16,059 


8,042 


2-5 


99,804 


98,408 


88,537 


61,293 


32,756 


16,460 


2-6 


99,967 


99,807 


98,430 


88,708 


61,148 


32,595 


2-7 


99,985 


99,955 


99,802 


98,476 


88,895 


61,699 


2-8 


99,988 


99,977 


99,962 


99,812 


98,467 


88,864 



7. Discussion 

After completing this work we were made aware of similar work already done by Higgins 
et al |25]. In [25] it was shown that the logarithmic factor can be removed, thus achieving 
the Heisenberg limit. Also, an experimental demonstration was given. However, an 
explicit algorithm which allows experimenters to implement this method was not given. 
Furthermore, the problem of noise was not dealt with. 
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8. Conclusion 

In this paper we have shown that there are gaps in the iterative phase estimation schemes 
of [121 US]. 

We have shown how to compute confidence arcs for 6, (26)^odi, (4^) modi) • • • 1 '^/modl) 
of length 1/3 and coverage probabihty at least 1 — e/l. The main contribution of this 
paper has been to give an explicit algorithm which uses these confidence arcs to obtain 
a confidence arc for 6 of length 1/(2'^^ x 3) and coverage probability 1 — e. Choosing 
e = 1/2^' gives 1 — {F{Uq^Uq)) = 0((logn/n)^), i.e. within a logarithmic factor of the 
Heisenberg limit. The advantage of our scheme is that unlike other iterative phase esti- 
mation methods, such as [151 E] ) it does not require an extra rotation gate capable of 
doing arbitrary rotations with almost perfect accuracy. Thus our scheme has a simpler 
experimental setup and less potential for error. 

Using computer simulations we have shown that the algorithm is successful. We 
have suggested the use of tables of simulated results to help choose the number of 
resources needed for desired levels of precision and coverage probabihty. 

We have analysed our estimation scheme in the presence of depolarizing noise with 
magnitude r. We have shown that the iterative algorithm is still successful in this case 
provided that no more than / = — log2 r iterative stages are performed. 
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Appendix 

Put X = cos(27re), y = sin(27r^), xq = 2N^=i/N - 1, yo = 2Ny=i/N - 1, (f){x,y) = 
atan2(y, x) and (j){xo,yo) = atan2(|/o, xq). Define 

A0 = min (^(0 - 0)mod27r, (0 - 0)mod27rj • 

Given a e [0, 1/a/2] and 

|x — xo| < a, (A.l) 
\y-yo\ < a, (A.2) 

then (xo, yo) lies in a square with sides of length 2a centred around (x, y). From simple 
geometry it is obvious that the maximum value of A0 occurs when (xo,i/o) is one of the 
four corners of the square. For this case, consider the triangle given by the points (0, 0), 
(x, y) and (xq, yo). The angle at point (0, 0) is A0, and is opposite a side of length \/2a. 
The angle at point (xo,?/o) is opposite a side of length 1. It follows from the sine rule, 
and monoticity of arcsin on [0,1] that 

A(j) < arcsin (V2a^ . (A.3) 
For the iterative algorithm it is required that A0 < vr/S, which holds if a = 0.612. Then 



(A.l) and (A.2) are equivalent to (18) and (19). 
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